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LARGE DEVIATIONS FOR RANDOM SPECTRAL 
MEASURES AND SUM RULES 



F. GAMBOA AND A. ROUAULT 



Abstract. We prove a Large Deviation Principle for the random spec- 
tral measure associated to the pair (J//v,e) where Hn is sampled in the 
GU E(N) and e is a fixed unit vector (and more generally in the f3 extension 
of this model). The rate function consists of two parts. The contribution 
of the absolutely continuous part of the measure is the reversed Kullback 
information with respect to the semicircle distribution and the contribu- 
tion of the singular part is connected to the rate function of the extreme 
eigenvalue in the GUE. This method is also applied to the Laguerre and 
Jacobi ensembles, but in thoses cases the expression of the rate function is 
not so explicit. 



1. Introduction 

Let U be a unitary operator in a Hilbert space Tt, and e be a unit cyclic 
vector (the span generated by the iterates (U n x) is H). The spectral measure 
associated with the pair (U, e) plays an important role and will be one of the 
object studied here. This measure is the unique probability measure (p.m.) fi 
on the unit circle T such that 

(e,U n e) = I z n dn(z) (n > 1) . 
Jt 

Assume further that dim 7i = N and that e\ the first vector of the canonical 
basis is cyclic for U. Let Ai, . . . , Xn be the eigenvalues of U (all lying on T), 
and let ipi, ... ,ipN be & system of unit eigenvectors. The spectral measure is 
then 

N 

^ = (i) 

k=l 

with 7Tfc := | (?/>fc , ei ) | 2 k = l,...,N. Notice that given the vector ip^ is 
determined up to a phase, but the number irk is completely determined. To 
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avoid confusion, we put an index w (for weight) to distinguish this measure 
from the classical empirical spectral distribution (ESD) defined by 

k=l 

When U is uniformly sampled from U(A) (the unitary group of order N) 
with the Haar distribution, it is well known that the joint distribution of 
(Ai, . . . , Ajv) has a density proportional to 

|A(Ai, . . . , Aat)| 2 

where A is the Vandermonde determinant (see for example [Meh04| ). Fur- 
thermore, ei is almost surely (a.s.) cyclic and (tti, . ■ ■ ,ttn) is independent of 
(Ai, . . . , Ajv)- Moreover, (ttx, . . . , vrjv) is uniformly distributed on the simplex 
S n = {(t!"i> ■ • • j ttjv) : vrfc > 0, (/c = 1, . . . , N), 7Ti + . . . 7rjv = 1}- As N tends 
to infinity, both sequences of random measures (jujj ■ ) and (/Xu ) converge 
weakly to the equilibrium measure, i.e. the uniform distribution on T. In 
a previous work ( [GR08] ). we have established a strong connection between 
the asymtotic properties of (/iw^) an d the random moment problem studied 
in [GLC04| and [CKS93] . In particular we proved that the sequence (li^) 
satisfies a Large Deviation Principle (denoted hereafter LDP), with speed N 
and good rate function given by reversed Kullback entropy with respect to the 
equilibrium measure (see the Appendix where we recall the definition of this 
quantity). For the sake of completeness, we also recall in the appendix the 
definition of a LDP and some useful results on large deviations. Notice that 
there is a quite important difference in the large deviation behaviour of (//w^) 
and (li^). Indeed, this last sequence of probability measures (p.ms.) satisfies 
a LDP with speed N 2 and with a rate function connected to the Voiculescu 
entropy (see for example [HPOOj l, To show a LDP for (/iw^) one may think 
of two kinds of proof. The first one, which could be called the direct way, 
uses the representation ([!]) |GR08j . Besides, it is possible to code a measure 
[x on T by the system of its Verblunsky (or Schur) coefficients, via the Favard 
theorem [Sim05bJ; they are also the canonical moments of fj, (see |DS97| for 
the definition). The second method uses this coding. It turns out that, under 

the Haar distribution, the canonical moments (c£ , . . . of jujjr are inde- 
pendent random variables (r.vs.) with explicit distribution depending on N. 
It is then possible in a first step to check the LDP on these variables and in 
a second step to lift the LDP and the rate function on the space of measures 
[LC05] . 

The precise form of the rate function can be explained, in the first method 
by the Dirichlet weighting of the random measure, and in the second method 
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by the Szegd formula, which enters in the class of the so-called sum rules. The 
same thing can be done for the Jacobi ensemble with the arcsine distribution 
(on [0, 1] or on [—2, 2]) playing the role of the uniform distribution on T (see 
[GR08] k 

In this paper we will focus on models of self-adjoint matrices and their 
extensions. If H is a self-adjoint bounded operator in a Hilbert space Ti and 
e a cyclic vector, the spectral measure is the unique p.m. /«onM such that 

(e,H n e) = [ x n dfi{x) (n > 1) . 
Jr 

If dim Ti = N and e\ is cyclic for H, the spectral measure is 

N 

H^=J2^Sx h (3) 

k=l 

with the same notation as above, except that, now, the eigenvalues are real. 

The first two models studied here leads to an eigenvalue distribution that is 
not almost surely (a.s.) supported by a fixed compact set. We will first study 
the /3-Hermite ensemble. It is a family extending the Gaussian ensembles 
(GOE, GUE, GSE). The second model considered is the /3-Laguerre ensem- 
ble that generalizes Wishart matrices. In both cases, we could expect that 
the sequence (fj^ ) satisfies a LDP with speed N and with a rate function 
given by the reversed Kullback entropy with respect to the limit distribution 
(respectively semicircle and Marchenko-Pastur distributions). Actually the 
difference with the unitary case comes from the problem of support. We prove 
results of the same flavour that those we previously obtained in the unitary 
case, but with an extra contribution in the rate function due to the singular 
part of measures. The third model studied is the /3-Jacobi ensemble in which 
the eigenvalues are confined in a compact set. 

The paper is organized as follows. The next section is devoted to the in- 
troduction of notation and models : topology on space of moments and real 
matrix models that we will study later. In Section [3l we discuss some rela- 
tionships between the random spectral measures and coefficients appearing in 
the construction of the associated random orthogonal polynomials. The LDP 
for real matrix models are studied in last two sections. The case of the (3- 
Hermite ensemble is completely tackled in Section [H Surprisingly, we manage 
to compute explicitely the rate function, with the help of a convenient sum 
rule |Kil07] . The /3-Laguerre and 0- Jacobi ensembles are studied in Section [5] 
and[6l Here, the rate functions are not so explicit. All useful distributions we 
work with are defined in Section 17.21 Furthermore, basic facts on LDPs are 
recalled in Section P7TT1 
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2. Notation and models 

2.1. Topology on moments spaces. Let M 1 be the set of all p.ms. on HI 
and let be the subset consisting of p.ms. on HI having all their moments 
finite. For \i 6 we set 



= / x k dfi(x) , k > 1 , 
Jr 



and m(^) = (mfc(/x)) fc>r As it is classical in moment problems, we consider 

the set A4^ as a subset of M[[X]], (the set of formal series with real coefficients), 
or equivalently as a subset of the set of linear forms on the space R[X] of 
polynomials with real coefficients, or eventually as a subset of R N . We may 
identify // either with 

• The formal series Yl™=o m n{li)X n , 

• The linear form on R[X] : P(X) ^ J R P(x)dn(x) 

• The sequence m(/i). 

We endow M.^ with the distance of convergence of moments: 

If A^ d denotes the subset of consisting in all p.ms. determined by their 
moments, then the mapping m is injective and continuous from M. l m d to M N . 



2.2. /3-Hermite ensemble. Let us recall some classical Gaussian matrix 
models and their extensions. 

• GOE(N) The diagonal entries are independent and A/(0;2/iV) dis- 
tributed and the non diagonal entries are independent up to symmetry 
and A/(0; 1/N) distributed. The joint density on 1R N of the eigenvalues 
is proportional to 

A(Ai, . . . , Aat) exp ~ A j • 

3 

The matrix of eigenvectors is orthogonal, so its first line is uniformly 
distributed on the iV-dimensional sphere, i.e. the vector (iri, . . . ,ttn) 
has the distribution Dirjv(l/2). 

• GUE(N) The diagonal entries are independent and A/(0;l/iV) dis- 
tributed and the non diagonal entries are independent up to symmetry 
and distributed as A/(0; 1/2N) + v^A^O; 1/2N) where both normal 
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variables are independent. The joint density of the eigenvalues is pro- 
portional to 

A(Ai,...,A 7V ) 2 exp-y ^A 2 . 

j 

The matrix of eigenvectors is unitary, so the first line is uniformly 
distributed on the TV-dimensional (complex) sphere, i.e. the vector 
("7Ti, . . . , 7rjv) has the distribution Dirjv(l)- 

If M is sampled from the GOE(N) or GUE(N), e\ is a.s. cyclic, the 
eigenvalues are a.s. distinct and then we will consider the (random) 
spectral measure /iw given by Q. 

We do not recall the definition of the symplectic ensemble GSE(N). 
Nevertheless, some of the previous objects may also be defined in this 
context. 

• More generally, it is now classical to consider a parameter (3 = 2(3' > 0, 
and a density in R proportional to 

|A(A 1 ,...,A Ar )|^exp-^^A J 2 . 

j 

This expression extends the above formulas so that (3 = 1 for the GOE, 
(3 = 2 for the GUE and (3 = 4 for the GSE. It is often called a Coulomb 
gas model and (Ai, . . . , \n) are called charges. 

Dumitriu and Edelman ([DE02j Theorem 2.12) found a matrix model 
for this distribution, i.e. a random real symmetric matrix whose eigen- 
values follows the above distribution. Moreover they proved that the 
corresponding vector (tti, . . . , itn) is independent of the eigenvalues 
and Dirjv(/3') distributed. A specific description of the matrix will be 
given in the next section. 

When N — ► oo, it is known that (/xi^) converges weakly to the 
semicircle distribution, and satisfies a LDP with speed N 2 and with a 
rate function connected to the Voiculescu entropy. 

2.3. /3-Laguerre and /3-Jacobi ensembles. 

• The classical Wishart real ensemble is formed by W = G l G with G a 
m x N matrix with independent AA(0, 2/N) entries. The joint density 
of eigenvalues is proportional to 

in „ m 

I a m\i TT -v §(JV— m+1)— 1 jS . 

|A(A)| J_J_A| exp-— ^A^ 

3=1 j=l 

and the distribution of weights (wi, . . . , Tr m ) is Dir/v(l/2). 
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This eigenvalues distribution is classicaly extended to the /3-Laguerre 
distribution of charges, with density proportional to : 

m atp m 

|A(A)| , nA m- m+ l)-l exp _^ EAj . 

3=1 3=1 

For this case, Dumitriu and Edelman ([DE02J Theorem 3.4) also gave 
a (real symmetric) matrix model and proved that the vector of weights 
(7Ti, . . . , 7r m ) are also independent of the eigenvalues and is Dir/v(/3') 
distributed. 

• The Jf3E(N;&,b) ensemble (with a > — l,b > —1) has been defined 
to extend the MANOVA ensemble known in statistics for 0=1 and 
(3 = 2. It is defined by a density of N charges on [—2,2] 

N 

\A( Xl , . . . ,x N )fl[(2 - Xj ) a (2 + Xj ) b . 

3=1 

Killip and Nenciu ( [KN04] ) found a matrix (real symmetric) model and 
proved that the corresponding vector of weights is again independent of 
the eigenvalues and Dun {ft') distributed. A variant is the J0E(N, a, b) 
ensemble where the charges are distributed on [0, 1] according to a 
density proportional to 

N 

\A(x 1 ,...,x N )\ l3 l[x h J (l-x j r, 

3=1 

In the matrix model, the weights have the same properties as above. 

3. Tridiagonal representations 

3.1. Spectral map. In this section, we will describe the Jacobi mapping 
between tridiagonal matrices and spectral measures. This mapping will be one 
of the key tools for our large deviations results. We consider finite size matrices 
corresponding to measures supported by a finite number of points and semi- 
infinite matrices corresponding to measures with bounded infinite support. 
The material of this section is largely borrowed from [Sim05a] , [Sim07j [Sim98] . 

If fj, is a probability measure with a finite support consisting of N points 
the orthonormal polynomials (with positive leading coefficients) obtained by 
Gram-Schmidt procedure from the sequence l,x,x 2 , . . . ,x N ~ l satisfy the re- 
currence relation 



xp n {x) = a n p n+ i(x) + b n p n (x) + a n _ip„_i(x) 
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for n < N — 1, where a n > for those n. In the basis {po,pi, ■ ■ ■ ,p n -i}, 
linear transformation x ^ xf(x) in L 2 (dfi) is represented by the matrix 



the 



Ju 



I b a 

a bi 

'•• 

Vo ..." 





at 



\ 







O.N-3 bpf-2 aN-2 

ajv-2 b N ^iJ 



(5) 



So, measures supported by N points lead to Jacobi matrices, i.e. N x N 
symmetric tridiagonal matrices with subdiagonal positive terms ; in fact, there 
is a one-to-one correspondence between them. Let be given such a Jacobi 
matrix J and let e\ be a cyclic vector. If /i is the spectral measure associated 
to the pair (J, ei), then J is nothing more than the representation of the 
multiplication by x in the basis of orthonormal polynomials associated to [i 
(and J = J^j). More generally, if is a p.m. on IR, with bounded infinite 
support, we may apply the same Gram-Schmidt process and consider the 
associated semi-infinite Jacobi matrix: 



Ju 



( bo ao 
ao h 
oi 

v 



o 

01 
&2 a 3 



J 



(6) 



Notice that again we have a^ > for every k. The mapping p ^ J a (which 
we call Jacobi mapping) is a one to one correspondence between p.ms on R 
having compact infinite support and this kind of tridiagonal matrices with 
sup n (|a n | + |6 n |) < oo. This result is sometimes called Favard's theorem (see 
|Sim05aj p.432). 

Furthermore, a compactly supported p.m. p is completely determined 
through the knowledge of all its moments m^fj,) for k > 1. So, an inversion 
formula for the Jacobi mapping may be performed by using J u to compute 
the moments of [i (see for example |Sim98]). Actually, there is a recursive 
procedure connecting successive moments with sucessive sections of the ma- 
trix. For a general Jacobi semi-infinite (resp. N x N) matrix A, let for 
j > 1 (resp. for j < N) the left top submatrix of A. It is known from [Sim98] 
formula (5.37), that if A is semi-infinite, we have the identity 



( ei ,A fe ei ) = (ei,(AW) k ei) , k=l,...,2j 



1. 



(7) 



It is straightforward that this formula holds true when A is a Jacobi N x N 
matrix, as soon as j < N and k < 2j — 2. When A 



J u , the Jacobi matrix 
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associated to a p.m. /x, we get, in terms of the moments : 

m k (n) = (ei,(j^) fc ei) , k = 1, . . . , 2j - 1. (8) 

for every j if /i as an infinite support, and for j < JV if /i is supported by iV 
points. Notice that this kind of formula leads to Gauss-Jacobi quadratures. It 
means that, there exists a sequence of polynomials f r of 2[N/2] + 1 variables, 
such that 

m r (n) = f r {b , . . .,6[ r / 2 ];oo, • • .,a[ r /2]-i) > ( 9 ) 

for any r if /i as an infinite support, and for r < 2iV — 1 if fi is supported by 
N points. 

Notice that the inverse relations are quite intricated (see for instance Simon 
|Sim98] Theorem A2). Actually, a n depends on mi, . . . , m2 n +2 and b n depends 
on mi, . . . ,m 2n +i- 



3.2. Tridiagonal representations of /3-ensembles. We now consider the 
Jacobi mapping for our random matrix models. The case of the /3-ensembles 
is directly obtained by the representation proposed by Dumitriu and Edelman 
( [DE02] ). 

• For the normalized G/3E this representation is 



(AO „(A0 



H (N) 







'0 



V 



'0 



AN) AN) 



AN) 



a 



(N) AN) 



N-3 



J N-2 

AN) 

L N-2 



o \ 



a (N) 

u N-2 
AN) j 
°N-1/ 



where the variables a 



(N) 

' 



AN) AN) 
)«7V-2' W ' 



j ^iv-d are independent and 



b<P ( = } AT(0; (/3'iV)- 1 ) 



(n) 



^(p'(N-l-j),(P'N)-i) 



(10) 



It means that Hp has the same joint distribution of eigenvalues as for 
the G(3E{N). Moreover the weights are independent of the eigenvalues 
and have the required distribution. 
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For the L/3E (N,m(N)) the representation is 



(dp o 

,(#) ,(iV) 



B 



(N) 



\ 










(AO 





o \ 



AN) 

m(N)—\ 

(AT) 
3 m(N)-l 



A N ) 
l m(N) / 



where the variables d\ 
and 

(AT) W 



d 



(N) AN) 



m(iV)' S l 



(AT) 



(A/") 
S m(AT) 



x are independent 



7^(m(^)-i) > (W _1 



• The representation of the J(3E(N; a, b) has been obtained by Killip 
and Nenciu ([KN04]). Actually, they consider a measure /x on [—2,2] 
with finite support as the projection of a symmetric measure ft on 
the unit circle T = {z : \z\ = 1} by the mapping z i— > z + z _1 . 
The Jacobi parameters (ao, • • • ; bo, ■ ■ ■ ) of \i are in bijection with the 
Verblunsky coefficients (ao, • • • ) of fi by the Geronimus relations (this is 
also true for measures with infinite support, sec [Sim05a] section 11)). 
Notice that choosing a probability distribution to sample Verblunsky 
coefficients leads to a probability distribution on Jacobi matrices. 



Theorem 3.1 (Killip-Nenciu, Theorem 2). Given (3 > 0, let a k 
k < 2N — 2 be independent and distributed as follows: 



(AT) 



< 



a 



(AT) (d) 
2p - 



& ((iV - p - 1)/?' + a + 1, (JV - p - l)/3' + b + l) , 



4^1 = ft {{N-p- 1)/?' + a + b + 2, (N - p)(3' 



for p = 0, . . . , N — 1. Let a 



(AO 
2AT-1 



ft 



(AT) 



1 and defin$\ 



d*0 



(! - a fk-i) a 2k - (1 + « 



AN) 



(i-«^ 1 )(i-K^) 2 )(i + <i 1 ) 



(TV), 



2fc-l/°2fc-2 
AN) 



(12) 



(13) 



^these are the Geronimus relations 
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Then the eigenvalues of the tridiagonal matrix A^ N ' built with these 
coefficients ajj. and have a joint density proportional to 

N 

\A(x 1 ,...,x N )f~[[(2-x j ) a (2 + x j ) h 
i=i 

and the vector of weights is Dir(P') distributed. 

We call J(3E(N, a, b) ensemble the above distribution on tridiagonal 
N x N matrices. 

Since it is often convenient to work on [0, 1] instead of [—2, 2], let us 
introduce the affine mappings : 

x £ [0, 1] A 4x - 2 (14) 
yG [_ 2 ,2]A^ (15) 

We call JPE(N, a, b) the image of J(3E(N, a, b) by s. The preceding 
result may be rephrased in the following way: 

Corollary 3.2. If A^ N ^ is sampled in the J(3E(N, a, b) ensemble, its 
eigenvalues have a joint density proportional to 

N 

\A(x l ,...,x N )fHx h J (l-x J T, 

and the vector of weights is Dir(P') distributed. 

4. Large Deviations in the /3-Hermite ensemble 
4.1. Introduction. Recall that the sequence of ESD 



k=l 

satisfies the LDP with speed (5'N 2 and good rate function 

/» = -E(fi) + / ^d»(x) + K H , 
Jr 1 

where Kh is a constant (sec [BAG97J) and 

S G") = / / log |x - y\ dfi(x)dfi(y). 
J Jr 2 

The equilibrium measure, unique minimizer of / u , is the semicircle distri- 
bution (denoted hereafter SC, see Section I7T2]) . In particular, the sequence 

(A*u ) converges weakly in probability to SC. 
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To manage the large deviations of (/j^j ), we will first tackle the large 

deviations of (a^ib^, k > 0). It is important to notice already that, in 

view of (fTUj) . as N — > oo, we have for fixed k > , a k — ► 1 and b k — > (in 
probability). The corresponding infinite Jacobi matrix which satisfies 

b k = , a k = 1 , > , 

(often called the free Jacobi matrix, see Simon ( }Sim05bj p. 13) is with 
H = SC. 

In the large deviations properties of the extremes eigenvalues will 

play an important role. As a matter of fact, the following function will appear 
in our rate function. Let, for x > 2 



F G ( X ) = J ^/t 2 -4dt = 4 J 2 \/t 2 -ldt 



xr 



= - \J x 2 - 4 - 2 log 

2 \ z / 

Further, for x < —2 set J~g( x ) = J~g{— x). The following theorem gives the 
large deviations properties for the largest eigenvalues in the G(3E(N) model 
frame. 

Theorem 4.1. For the G0E(N) model the S6QU6TIC6 (AmaL 

) satisfies for x > 2 



lim — log P(AW > x) = -^ G (x) . (16) 

The statement and proof for the GOE are due to {BADGOlj Theorem 6.1, 
the case GUE is in [Mai07j Prop. 3.1 and the general case, although not treated 
completely is quoted in Feral [Fer08]. Actually, the proof depends only of the 
explicit form of the joint distribution of charges and is easily extended. 

To prepare the statement of our main result, we need another definition. 

Definition 4.2 (Simon). We say that a p.m. fj, onM satisfies the Blumenthal- 
Weyl condition (B.W.c) if 

i) Supp(^) = [-2,2]U{^}^T 1 U{ J B+}^ = + 1 where N + (resp. N~ ) is either 
0, finite or infinite, 

£f < E 2 < • • • < -2 and E\ > E% > • • • > 2 

are isolated points of the support. 
ii ) If N + = oo (resp. N~ = oo) then E^~ converges towards 2 (resp. E~ 
converges towards —2). 
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4.2. Main result. Here is our main result. Notice that, of course, SC is the 
unique minimizer of the rate function, in accordance with the remark at the 
beginning of this section. 

Theorem 4.3. The sequence ) satisfies the LDP in M.^ d with speed 

(5' N and good rate function 

toa|i/) + En=l^G(^n)+En=l^G(^n) if u satisfies B.W.c, 
1 +oo otherwise . 

(17) 



Hence, the rate function I{v) is finite if and only if 

N+ N~ 
v{dx) = f a (x)SC(dx) + v s {dx) + K n 5 E + {dx) + K n S E - (dx) 

n=l n=l 

where v s is singular (with respect to the Lebesgue measure) and is supported 
by a subset of [—2, +2] and 

2 N+ N- 

- / log f a (x)SC(dx) < oo , Y,?g{K) + Y, J7 g(E-) < oo. 

^ ~ 2 n=l n=l 

In this case 

r 2 ( /I 2 \ /7 2 N+ N ~ 

Proof : For k > 0, the subset M(k) of of all p.ms supported by [—k, +k] 
is compact for our topology. Indeed, for p.ms in M(k) the moment maps are 
continuous function (M(k) is tight for the convergence in law). From Theorem 
14.11 we know that 

lim lim ^-logP(A^ >k) = -oo. 



By symmetry, we have also 



lim lim — logP(A^ < ~k) = -oo. 

k^oo N^co iv 

This implies 

lim lim — logPfrtJW t M(k)) = -oo , 
hence the sequence (fJ^ N ^)pf is exponentially tight. 

From the inverse contraction principle (see [DZ98] Theorem 4.2.4 and Re- 
mark a)) it is a consequence of the two following theorems: the first one is 
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a LDP for the sequence of moments and the second one is a magic formula 
which allows a powerful identification of the rate function. ■ 

We now give one of the main ingredients of our LDP proof for G/3E(N) en- 
sembles. First define the functions g(x) := x — 1 — log x if x > and g(x) = oo 
otherwise and let 

g(x 2 ) if x > 
oo otherwise . 



G(x) :-- 



Theorem 4.4. The sequence (m(fj^')) satisfies in R N the LDP with speed 
(3'N and good rate function I defined as follows. This function is finite if and 
only if there exist (bo, . . . ; ao, • • • ) G K N x (0, oo) N satisfying 

oo oo 

^6, 2 <oo, ^( aj -l) 2 <oo (19) 

such that m r = (e\,A r ei) for every r > 1 with A infinite tridiagonal matrix 
built with (6q, . . . ; ao, • • • )• In that case 



1 x ' 
/(mi, . . .) = ~J2 + G (%) < 00 ■ 



2 

j=0 3=0 

Theorem 4.5 (Killip-Simon [KS03j . |Sim05cj Theorem 13.8.6). /ei J be a 

Jacobi matrix built with (ao, • • • ; feo, • • • ) G (0, satisfying sup[a n + 

sup|6 n |] < oo. Let /i be the associated measure obtained by Favard's theorem. 
Then 

[b 2 k + (a k - l) 2 ] < oo (20) 

k 

if, and only if, the p.m. fi satisfies B. W.c. and the two following conditions: 

N+ N- 

^-2) 3 / 2 + ^(-2-^)3/ 2<00 ( 21) 

3=1 3=1 
f2 



J \og{f a {x)WA-x 2 dx>-<x. (22) 



In that case 



h 2 n + G(a n ) 



(23) 



where both sides may be (simultaneously) infinite. 

The proof of Theorem 14.41 will use the following result. 
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Lemma 4.6. For fixed k, (6q , . . . , bj^ ; Oq , . . . , cl^_i) N>k satisfies in M? k ~ 1 
a LDP with speed (3'N and good rate function 

j k fe-1 

I k (b , ... ,b k ;a ,... , a k -i) = - ^ bj + ^ . (24) 

j=0 i=o 

Proof It is an immediate consequence of independence and the LDP for 
Gaussian and Gamma r.vs. recalled in the following lemma. ■ 



Lemma 4.7. 

(1) The sequence of distributions AA(0;n -1 ) satisfies the LDP with speed 
n and good rate function x i— ► x 2 /2. 

(2) For a > and c fixed, the sequence of distributions r y((n — c), (an) -1 ) 
satisfies the LDP with speed n and good rate function x i— ► g(ax). 

(3) For it, v > and 5, <5' /ixed, f/ie sequence of distributions (3 s (un+5, vn+ 
5') satisfies the LDP with speed n and good rate function: 

h ^ = \ ( i(u-v)-ulog(l + q)-vlog(l-q) ; <?e(-M) ^ 

I oo ; otherwise . 

Proof The points 1 and 2 are well known. For point 3, we use the represen- 
tation 

, (d) j(un + 5)- jjvn + 5') 

(3 s (un + 8,vn + 5) = — — — 

7(nn + o) + 7(un + o'J 

hence by contraction the rate function is 

x — y 

h(q) = M{ug(x/u) + vg(y/v); — ■ — = q} , 

x + y 



which yields easily (|25j) . 



Proof of Theorem 14.41 : Fix I > 1. By Lemma 14.61 and the contraction 
principle, the sequence (mi . . . , rn,2g-i(fA )) satisfies the LDP in M 2 ^ 1 

with speed (3'N and rate function I^t—i defined as follows. Notice that there is 
at most only one tridiagonal matrix Ai built from (bo, . . . , be-i; clq, . . . , ai-2) 
as in ([5]) such that 

m r = (e 1 ,A r e e 1 ), r = 1, . . . , 2£ - 1. (26) 
Hence, if (mi, . . . , m^-i) satisfies ([26]) . then 

^-i(mi,...,m2<-i) = -^-i(&Q>- ■ ■ >&£-i;oq,. . . ,a£_2) ( 27 ) 

Otherwise, /2£-i( m i> • • • > TO 2£-i) is infinite. We do not consider the even case 
since there is no injectivity in that case. 
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We now apply the Dawson-Gartner theorem. Let us endow M[LY]] with the 
topology of pointwise convergence of coefficients. It can be viewed as the 
projective limit 

R[X] = limRfcpf] 

where Rfc[X] is the set of polynomials of degree equal or less than k. 
The rate function is 

/(mi,...) =sup{I 2 fc + i(mi,...,m 2 fc + i) : k > 0} . (28) 

It is clear that 

j k k-1 

sup{7 2&+ i(mi, . . . ,m 2 k+i) ■ k > 0} = sup{- ^ b] + ^ G(a,j)} 

k j=0 j=0 
_. oo oo 

j=0 j=0 



4.3. Failure of the direct method. Mimicking the unitary case ([G R08] ). 
it is tempting to define the random measure 

N 
k=l 

with the Yfc independent and j((3') distributed so that 



~(N) 

(TV) 



The problem is that the method of Najim |Naj02| cannot be applied as the 
important assumption on the range of the eigenvalues is violated. Indeed, 
not all the eigenvalues ly in the support of the semicircle law. So that, the 
conclusion given by this approach is wrong. The rate function candidate only 
contains the Kullback part of the LDP but loose the outer part. 



4.4. Optimization of the rate function. In this subsection we discuss the 
problem of minimizing the rate function I v under moment constraints. Sur- 
prisingly, using basic tools of large deviations, we show that this optimum 
value may be expressed in terms of the coeficients (a^) and (&&). This result 
has the same spirit as the relation between the optimum of Burg entropy under 
trigonometric moment restrictions and the Verblunsky coeficients (see [GS84 
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and |Bur75] ), To begin with, for I > 1, we define the moment sets 



M 2 ;_! := x^{dx)^j r^e-A^j, 

m 2~i-? ■= \ yj x j n(dxYj : n G M\ Supp/i C [-2,2] 



j=l,..,2Z-l 
[—2 2l [ 2 2l 

For c G M 2 /_i (resp. c G ML ^ ), let <S 2 z-i(c) (resp. (S^-i (c)) denotes the 
set of all p.ms on IR (resp. on [—2,2]) having c as 21 — 1 first moments. 

Corollary 4.8. 

1) Let c G /nt M 2 z_i 7 i/ien 

Where as before, the real (0}).?'=l,— ,i-2 anc ^ (ftj)j=i,...,z-i i/ie coe/i- 
cients of the tridiagonal matrix ([5]j. 

2) Lei c G Int (M^f 1 ) i/ien 

inf = inf X:(5C|M) = ^6| + 2G(a i ) 



21-1 r 2 21-1 

sup ( vq + V] VjCj + / log ( 1 — vq — VjX j ) SC(dx) ) , 

(^)6lR 2i V j=1 J-2 X J=1 ' 



(here log(-u) = — oo whenever u < 0). 

Proof The proof of the first part directly follows from the contraction prin- 
ciple (see [DZ98J). The first equality of the second part is obvious as Tq is 
non negative. The last equality is a direct application of general optimization 
procedures developed in |GG97j or [BL93]- ■ 



5. Large Deviations in the /3-Laguerre ensemble 

In the Laguerre case, in the usual asymptotics ./V — > oo, m(N)/N — > r < 1, 
we observe similar phenomena. Recall that the sequence of ESD 

m(N) 
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satisfies the LDP with speed f3'N 2 and good rate function 

/» = -E(/i) + j™ (| - (1 - r) log x) d/*(x) + , 

where i^x is a constant ([HP98J). The equilibrium measure, unique minimizer 
of / u is the Marchenko-Pastur distribution of parameter r (denoted hereafter 

by MP, see Appendix). In particular, the sequence (/iu ) converges weakly in 
probability to MP. 

To manage the large deviations of (/ii ), we will first tackle the large 
deviations of (s^' m ,d^' m , k > 0). Recall that the elements of the tridiagonal 
matrix L^' are 

bi N) = (4 N) f , € ] = rf^ + O 2 (i<*<^-i) 

4 N) = -fficffi (0<^<iV-2). (29) 
We can see already that, in view of (llljl . we have for fixed k > 1 and iV — > 



oo, limc?^ V ' 1 = 1 and limsjj^ = ^fr (in probability). From (|29p . this yields 

limftg^ = 1 and for fixed k > 1, lim&jj. = 1 + t , lima^j = y^r (in 
probability). The corresponding infinite Jacobi matrix which satisfies 

b = 1 , b k = 1 + t , (fc > 1) ; a fc = v 7 ^ ; (^ > 0) . 

is J^j with = MP. 
Let J~l defined by 



Fdx) 



,>« t) , 



f' V(a(r)-t)(i(r)-i) ^ 

/ — ai < x < a(r) . 

Jx t 



This is actually the rate function of the LDP for the extreme eigenvalue, as 
quoted in Feral |Fer08| . 

Proposition 5.1. For the L(3E(N,tN) model, 

(1) the sequence (XmJ x ) satisfies for x > 6(r) 

lim -L logP(AW >x) = -F L {x) . (30) 

(2) the sequence (A^2j satisfies for < x < a(r) 

l tw log¥{x ™ - x) = • (31) 
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For a general double sequence of positive numbers (d k )k>i an d (sk)k>i we 
set d o s = (di, . . . ; si, . . . ). We deduce the elements 

6 = 4 . h = 4 + d l+i (k > 1) 

a k = s k+1 d k+1 (k>0). (32) 

Conversely, if (do, • • • ; bo, . . . ) is given in (0, oo) NxN such that the tridiagonal 
matrix is positive, there exists a unique do s satisfying (|52"j) . Here is a direct 
consequence of Lemma 14.71 

Theorem 5.2. Under the L[3E{N,tN) model, the sequence (/il ) satisfies 
in A^i^ j((0, 00)) a LDP with speed f3'N and good rate function I v defined 
as follows. This function is finite at v if and only if there exists d o s £ 
[0, oo) N x [0, oo) N (necessarily unique) satisfying 

G(d k ) + G(s k /VT) < 00 , 

k k 

such that m r (v) = {ei,A r ei) for every r > 1 with A infinite tridiagonal matrix 
built with (bo, . . . ; ao, ■ ■ ■ ) satisfying HM) . In that case 

/» = G« + r Y, G{s k /V^) ■ (33) 

k k 

Remark 5.3. 

• It is clear from (33$ that the unique minimizer of I v corresponds to 
d k = 1 and s k = ^fr which corresponds to MP. 

• When t = 1, we can write: 

/» = £[4-i-iog4 + 4-i-io gs £] 

k>l 

= d 2 1 -l + Y,[dl +1 + s 2 k -2]-2^og(d kSk ) 

k>\ k>l 

= 6 -l + ^(6 fc -2)-2^1oga fc . 

k>l k>0 

This expression of P in terms of the Jacobi coefficients makes plausible 
the existence of a convenient sum rule and we propose the following 
conjecture : 

Conjecture 5.4. The rate function is 

F{v)=1C{MP\v) + Y,FL{Ef). 

j 
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Proof of Theorem 15.21 For k fixed, (d k ) (resp. (s k )) satisfies a LDP 
with good rate function G(dk) (resp. rG(sk/ 'y/r)) hence, by independence, 
the rate function is the sum (f33|) , ■ 



6. Large Deviations in the /3-Jacobi ensemble 

Let us consider the J(3E(N,ai(N),b(N)) ensemble. The usual asymptotics 
is N -> oo, b(N)/N -► F Kl , a(N)/N -» /3'k 2 - The sequence of ESD 

1 N 

k=i 

satisfies the LDP with speed /3'JV and good rate function : 

/ u (/i) = -S(/i) - / («! log x + k 2 log(l - a)) dn(x) + Kj , (34) 
Jo 

where Kj is a constant (see [HP05] ). The equilibrium measure (the unique 
minimizer of I u ) is the Kesten-MacKay distribution (denoted hereafter KMK) 
of parameter (ti_,u+), where 

/ 1 + K\ 1 + K\ + K2 

(see Section [7^2]) . In particular, the sequence (fi^) converges weakly in prob- 
ability to KMK. 

To manage the large deviations of (/iw ), w e will first tackle the large 
deviations of (ck^ , k > 0). It is important to notice already that, in view of 
(fT2j) . we have for fixed p > 0, 

,. (A) K 2 -Kl ,. (Af) Kl + K 2 

hm a,„ = , hm a 0n U = ■ 

N 2p 2 + Ki + K 2 N 2p+1 2 + Kl + K 2 

The symmetric measure admitting these limiting Verblunsky coefficients is 
well understood by its Cauchy-Stieltjes transform since the work of Geronimus 
QGcr44], see also the books of Simon). We do not give details here to shorten 
the paper. After projection, we obtain the KMK distribution. 

Let Tj defined by 



x J(t-U+)(t-U-) , 

x 1 -L dt u + < x < 1 



t(l-t) 



— at < x < n_ . 



t(l-i) 

This is actually the rate function of the LDP for the extreme eigenvalue. 
Indeed as in Feral |Fer08] . we can prove 
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Lemma 6.1. For the J(3E(N,a(N),b(N)) model with the above notations, if 

Kl,K 2 > 0, 

(1) the sequence (Xmax) satisfies for x G (u+, 1) 

lim-^logP(AW > x) = -Tj(x) . (35) 

(2) the sequence (A^J satisfies for x £ (0, «_) 

lim ^ log P(A^ <x) = -Tj(x) . (36) 

Proof We follow the scheme of proof of [Fer08] . Section 5.2. The potential 
is 

V(x) = —Kl log X — K2 log(l — x) . 

It is convex and C°° on the compact support of the equilibrium measure. It 
fulfills assumption (iv) of Theorem 4.2 therein. Besides from the formula (|46p 
giving the Cauchy-Stieltjes transform, we see that we are exctly in the same 
situation as in Feral. ■ 



Theorem 6.2. 

(1) ( G amboa- Rouault [GR08],) Under the J r (3E(lS , a, b) model, the sequence 

(/xl^) satisfies in A / J 1 ([0, 1]) endowed with the weak topology the LDP 
with speed N and good rate function 

I(y) = K,(ARCSINE \ v) . 

(2) Under the J(3E(N, K\N, k 2 N) model, the sequence (fJ-i N ^) satisfies in 
A4 1 ([0, 1]) endowed with the weak topology the LDP with speed N and 
with a good rate function I w defined as follows. This function is finite 
at v if and only if there exists a £ (—1, 1) N (necessarily unique) such 
that 

CO 00 

1(a) := (ki - k 2 ) "2fc + (ki + ^2) X! " 2fc + 1 


00 00 

- (1 + Ki) log(l + a 2k ) - (1 + K2) 1 °g( 1 ~ a 2fc) 



00 00 

- (1 + Kl + K 2 ) Y l ° g ^ + "2A:+l) - Y ~ °2fc+l) 



is finite. In that case 

i»=J(a). 



LARGE DEVIATIONS FOR RANDOM SPECTRAL MEASURES AND SUM RULES 21 

Proof We apply Lemma 14.71 (3), with n = j3'N, and for an even index we 
have u = I + K\,v = I + K2 and with odd index u = 1 + k\ + K2, v = 1 

I Of 2k {x) = x(k X - K 2 ) - (1 + Kl)log(l + x) - (1 + K 2 )log(l -X) 
I a 2 k+i( X ) = X ( K 1 + K 2) - (1 + «1 + «2) log(l + X) - log(l - x) 

Then it is enough to add up. 

In the particular case of a and b fixed, we have K\ = k,2 = and 

oo 

i(dO = -J> g (i-4). 



But the Szego formula ([Sim05b]) says that it is exactly the reversed Kullback 
with respect to the ARCSINE distribution. ■ 

In the general case, there is up to our knowledge, no known sum rule. 
Besides it is very intricate to express the above sums in terms of the tridiagonal 
coefficients. Nevertheless it is tempting to propose the conjecture. 

Conjecture 6.3. Under the J(3E(N, KiN, K2N) model, the rate function is 
given by 

I{v) = K{KMK I v) + ^Tj{Ef ) 

3 

7. Appendix 

7.1. Large deviations. We present here large deviations principles (LDPs) 
for sequences of random measures defined in the previous Section. For the sake 
of completeness we briefly recall the LDP definition. Let (u n ) be a decreasing 
positive sequence of real numbers with lim n _>oo u n = 0. 

Definition 7.1. We say that a sequence (Rn) of p.ms. on a measurable 
Hausdorff space (G, 13(G)) satisfies a LDP with speed u^ 1 and rate function I 
if: 

i) I is lower semicontinuous (Isc), with values in M + U {+00}. 

ii) For any measurable set A of G: 

—I(mtA) < liminf mat log i? at (^4) < limsuptijv log-R/v(^4) < —I (do A), 

where 1(A) = inf£ gy i/(£) and mi A (resp. do A) is the interior (resp. 
the closure) of A. 

We say that the rate function I is good if its level set {x £ G : I(x) < a} is 
compact for any a > 0. More generally, a sequence of G -valued r.vs. is said 
to satisfy a LDP if the sequence of their distributions satisfies a LDP. 
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The rate functions obtained in this paper are built with be the so-called 
Kullback information or cross entropy. Let P and Q be p.ms. on G, the 
Kullback information between P and Q is defined by 

( r dP dP 

KVTO =/ G lo g ^P ifP«Q and log - £ L'(P), ^ 

= +00 otherwise. 



7.2. The distributions. 

7.2.1. Gamma distribution. For a, b > 0, the 7(0,6) distribution is supported 
by [0, 00) with density 

e -x/b x a-l 



b a T(a) 

Its mean is ab. 

7.2.2. Beta distribution. For a, b > 0, the beta symmetric distribution of pa- 
rameter (a, b), denoted by f3 s (a,b), is supported by (—1, 1] and has density 

~l-a-b T ( a + b ) n „\a-m , „\6-l 



r(o)r(6) 



(l-xf-^l + x)' 



Its mean is j^. 



7.2.3. Dirichlet distribution. For A; > 1, we set 

Sk ■= {(xi, ■ ■ ■ , x k ) : Xi > 0, (i = 1, ■ ■ ■ , fc), xi H h x fc = 1} 

^ := - - - > x k) ■ Xi > 0, (i = 1, • • • , k), x 1 H h x fe < 1} . 

Obviously, the mapping (x\, • • • , x^+i) 1— > (xi, • • • , x&) is a bijection from the 
simplex onto 55- 

For aj > 0, j = 1, . . . , k + 1, the Dirichlet distribution Dir(ai, • • • , afe+i) on 
has the density 

r(tti + • • • + Ofc+i) ! a fc+ i-l m 81 

r(a 1 )-.-r(a fe+1 ) Xl ••• Xfe + 1 (d8j 

with respect to the Lebesgue measure on Sk+i- When a± = ■ ■ ■ = ak+i = a > 
0, we will denote the Dirichlet distribution by Direct). If a = 1 we recover the 
uniform distribution on S^. 
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7.2.4. Semicircle distribution. The semicircle distribution denoted by SC is 
supported by [—2, 2] with density 



X 



2 



2vr 

Its Cauchy-Stieltjes transforu0 is 

f dfi(x) -z + y/z 2 -4 
J x- z 2 

When < r < 1, the Marchenko-Pastur distribution, denoted by MP is 
supported by (a(r),6(r)) where a(r) = (1 — a/t) 2 , 6(r) = (1 + \/t) 2 with 
density 



27TTX 

Its Cauchy-Stieltjes transform is 



y/{x - q(t))(6(t) - a;) 



, -z+(l-r) + V(0-l-r)2-4r 
m(z) = 1— . (41) 

7.2.5. Kesten-McKay distribution. The Kesten-McKay distribution is supported 
by (u_ , u + ) with < u_ < u + < 1 and its density is 

y/( g - - x) 

2ttx(1 — x) 

where 



Cu., u+ ^\-ri, ' (42) 



1 



To express its Cauchy-Stieltjes transform, let us give some notation. For 
(6,c) G (0,1) x (0,1) we put 

a ± (b, c) = X - [l + Vb~c ± ^(l-6)(l- c )] , (43) 

and for (x, y) G (0, 1) x (0, 1) 

u±(x,y) = (l — x — y + 2xy) ± 2 y 7 x(l - x)y(l - y) 

V(l-*)(l-y)±yiy) 2 • ( 44 ) 
The mappings a± and u± are inverse in the following sense : 

(<T_ ,(T+) 

{(6, c) : < 6 < c < 1} s {(x, y):0<x<y<l and x + y > 1} (45) 



Throughout, all branches of the square roots are taken in accordance to the definition 
of Cauchy transform 
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The Cauchy-Stieltjes is then (see for instance [Dem06j p. 129, or [CC04| p. 425) 

(o+ - C7_) 



m(z) 



(1-O-H 



<7_ 



+ 



2(1 -a+)z 2(l-a+)(l-z) 
ARCSINE corresponds to ii_ = and u + = 1. 



+ ^gEp£E^±j . ( 46) 

2z(l — z) 
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